Efficient Support for Two-Dimensional Data Distributions in Distributed Shared Memory Systems

نویسندگان

  • David K. Lowenthal
  • Vincent W. Freeh
  • David W. Miller
چکیده

Despite their clear advantage in scalability, twodimensional data distributions are not efficiently supported by current software distributed shared memory (SDSM) systems. This is because sharing between nodes occurs on both columns and rows. Sharing in two dimensions is not a good match for SDSM systems, because either a rowor columnmajor data layout of pages leads to (1) severe thrashing, if a strong memory consistency is used, or (2) exchange of unnecessary data between nodes, if a relaxed memory consistency is used. This paper examines two alternatives for efficiently supporting two-dimensional data distributions in SDSM systems. We develop two new page consistency protocols for this purpose. One protocol, called Explicit-2D, requires that the user or compiler explicitly identify truly shared elements within a page; the other, called Implicit-2D, infers such elements implicitly. Knowledge of truly shared elements allows the SDSM, at synchronization points, to send only truly shared data, which reduces diff sizes. As the problem size or the number of nodes grows, programs written using a two-dimensional distributions with our new protocols are superior to those using a one-dimensional one. The difference in our tests is as much as 12% for Red-Black SOR, and increases with the problem size and number of nodes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Integrated Compiler/Run-Time System for Global Data Distribution in Distributed Shared Memory Systems∗

A software distributed shared memory (DSM) provides the illusion of shared memory on a distributed-memory machine; communication occurs implicitly via page faults. For efficient execution of DSM programs, the threads and their implicitly associated data must be distributed to the nodes to balance the computational workload and minimize communication due to page faults. The focus of this paper i...

متن کامل

SUIF-Adapt: An Integrated Compiler/Run-Time System for Global and Dynamic Data Distributions

Distributing data is one of the key problems in implementing efficient distributed-memory parallel programs. The problem is especially difficult in programs where (1) data redistribution between computational phases is considered or (2) the participating processors (nodes) executing a parallel application are not dedicated. In either case, the commonly used BLOCK and CYCLIC distributions no lon...

متن کامل

A Causal Logging Scheme for Lazy Release Consistent Distributed Shared Memory Systems

This paper presents a causal logging scheme for the lazy release consistent distributed shared memory systems. Causal logging is a very attractive approach to provide the fault tolerance for the distributed systems, since it eliminates the need of the stable logging. However, since interprocess dependency must causally be transferred with the normal messages, the excessive message overhead has ...

متن کامل

SilkRoad: A Multithreaded Runtime System with Software Distributed Shared Memory for SMP Clusters

Multithreaded parallel system with software Distributed Shared Memory (DSM) is an attractive direction in cluster computing. In these systems, distributing workloads and keeping the shared memory operations efficient are critical issues. Distributed Cilk (Cilk 5.1) is a multithreaded runtime system for SMP clusters with the support of divide-and-conquer programming paradigm. However, there is n...

متن کامل

System Software Support for Reducing Memory Latency on Distributed Shared Memory Multiprocessors

This paper overviews results from our recent work on building customized system software support for Distributed Shared Memory Multiprocessors. The mechanisms and policies outlined in this paper are connected with a single conceptual thread: they all attempt to reduce the memory latency of parallel programs by optimizing critical system services, while hiding the complex architectural details o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002